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(54) GENE CODING FOR NDOGLUCANASE 

(57)Abstract: 

PROBLEM TO BE SOLVED: To obtain a new gene which 
is composed of the gene coding for endoglucanase 
containing a specific amino acid sequence and is useful 
in the production of the above enzyme contributing to 
the effective utilization or the like of cellulose-based 
biomass or the improvement due to the elucidation of 
the specific characteristic of the enzyme. 
SOLUTION: This gene is a new DNA comprising the base 
sequence, which codes for the amino acid sequence of 
formula I (wherein, X is not contained or is the leader 
sequence containing formula II) showing endoglucanase 
activity or the amino acid sequence that is obtained by 
adding, deleting or substituting one or plural amino acids 
to this amino acid sequence and shows endoglucanse 
activity, and is useful in the biochemical analysis and 
high efficient production of this enzyme contributing to 
the effective utilization of cellulose-based biomass and 
in the improvement of the specific characteristic due to 
the elucidation of the specific characteristic of the 
enzyme or the gene modification. This DNA is obtained by incubating Corticium rolfsii to 
recover the microbial cell body, extracting the RNA, preparing a cDNA library by using this RNA 
and then screening out it with an antiendoglucanase serum. 
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P-cDNA««*7h h7^v- >*±8) 
tcDNAjllS, 1. 2//g(DcDNA^Lt; e 
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ie*lj£>:g£ : 3 9 0 

Met Phe lie Pro Me Ala Leu Val Ala Leu Ala Ala Ser Val Val Asn Ala Gin 

-15 -10 -5 1 

Gin Ser Ala Trp Gly Gin Cys Gly Gly Gin Gly Trp Thr Gly Ala Thr Ser Cys 

5 10 15 

lie Ser Gly Tyr Tyr Cys Gin Ala GJn Asn Ser Tyr Tyr Ser Gin Cys Val Pro 
20 25 30 35 

Gly Thr Ala Thr Ser Lys Thr Ala Arg Ser Thr Ser Thr Ala Pro Ser Ser Thr 
40 45 50 55 

Gly Ser Ser Gly Ala Arg Leu Pro Tyr Leu Gly Gly Val Asn Thr Ala Gly Tyr 

60 65 70 

Asp Phe Thr Val Asp Thr Thr Gly Thr Phe Thr Gly Thr Gly Val Val Pro Pro 

75 80 85 90 

Ala Ser Gin Tyr Ala His Phe Ala Asn Glu Gly Ala Asn Leu Phe Arg lie Pro 

95 100 105 

Phe Ala Trp Gin Leu Met Thr Pro Thr Leu Gly Gly Ser lie Asn Gin Thr Phe 
110 115 120 125 

Phe Gin Ser Glu Tyr Asn Pro Thr Val Gin Ala Ala Leu Ala Thr Gly Ala Tyr 
130 135 140 145 

Val lie Val Asp Leu His Asn Tyr Ala Arg Trp Asn Gly Gin lie Me Gly Gin 

150 155 160 

Gly Gly Pro Thr Asn Ala Gin Phe Ala Ser Me Trp Thr Gin Leu Thr Ser Tyr 

165 170 175 180 

Tyr Gly Asn Asn Pro Lys Val I le Phe Gly Leu Met Asn Glu Pro His Asp Leu 

185 190 195 

Asn Ser Me Pro Glu Trp Ala Asp Ser Leu Gin Tyr Val Val Asn Ala Val Arg 
200 205 210 215 

Ala Ala Gly Ser Thr Asn Tyr Leu Leu Leu Pro Gly Ser Ser Trp Ala Ser Ala 
220 225 230 235 

Gin Ala Leu Pro Thr Glu Ala Gly Pro Tyr Leu Leu Gin Me Thr Asp Pro Leu 

240 245 250 

Gly Gly Thr Asn Lys Leu Me Phe Asp Val His Lys Tyr Leu Asp Ser Asp Asn 

255 260 265 270 

Ser Gly Thr His Ser Asn Cys Val Thr Asn Asn Thr Gly Val Leu Gin Thr His 

275 280 285 

Val Thr Trp Leu Gin Gin Asn Gly Asn Arg Gin Ala Leu Leu Ser Glu Thr Gly 
290 295 300 305 

Gly Gly Ser Ser Asp Ser Ser Cys Glu Thr Tyr Val Ala Gin Glu Leu Ala Phe 
310 315 320 325 

Val Gin Ala Asn Lys Asn Asn Me Ala Gly Phe Ala Me Trp Ala Ala Gly Ala 

330 335 340 

Phe Asp Thr Thr Tyr Val Leu Ser Val Thr Pro Asn Ala Asp Gly Ser Asp Gin 

345 350 355 360 

Pro Leu Trp Ser Me Ala Val Lys Pro Tyr Leu Pro 



365 



370 
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IE*U0>:&£ : 1 3 7 2 

PE^'JCftflSi : comp I ementary DNA 
: C njisyis-f (Corticium rolfsii) 

^Sfc^ril^fE-^ :sig peptide 

: 2 7 . . 7 7 
^MSSS^IB-^ :mat peptide 

: 7 8 . . 119 6 
S"^tE^ : c I eavage — s i te 

: 3 1 8. . 3 2 3 
ifeCDin IB : K p n I 

^St^S^IS"^- :cl eavage — site 
1?£t£S : 5 1 9 . . 5 2 4 

tecDttfS : S a I I 

#$fe ; £l£"^~fB^- : c I eavage— s i te 

: 8 4 O . . 8 4 5 
m<Dmn : K p n I 

CTGGTTCTCT CATCTCAACT TACAAA AT6 TTC ATT CCC ATT GCA CTA GTT GCA 53 

Met Phe lie Pro lie Ala Leu Val Ala 
-15 -10 
TTA GCT GCC AGC GTG GTG AAT GCA CAA CAG TCT GCG TGG GGA CAA TGT GGT 104 
Leu Ala Ala Ser Val Val Asn Ala Gin Gin Ser Ala Trp Gly Gin Cys Gly 

-5 1 5 

GGT CAG GGC TGG ACT GGT GCC ACT TCT TGT ATC TCT GGT TAT TAC TGC CAA 155 
Gly Gin Gly Trp Thr Gly Ala Thr Ser Cys lie Ser Gly Tyr Tyr Cys Gin 

10 15 20 25 

GCG CAG AAC TCT TAC TAT AGT CAA TGT GTT CCT GGA ACT GCA ACT TCG AAG 206 
Ala Gin Asn Ser Tyr Tyr Ser Gin Cys Val Pro Gly Thr Ala Thr Ser Lys 

30 35 40 

ACG GCG AGG TCT ACG TCC ACA GCA CCA AGC AGC ACT GGA AGC TCC GGA GCC 257 
Thr Ala Arg Ser Thr Ser Thr Ala Pro Ser Ser Thr Gly Ser Ser Gly Ala 
45 50 55 60 

CGT CTG CCA TAT CTC GGT GGT GTA AAC ACG GCT GGT TAT GAT TTC ACC GTG 308 
Arg Leu Pro Tyr Leu Gly Gly Val Asn Thr Ala Gly Tyr Asp Phe Thr Val 

65 70 75 

GAT ACC ACC GGT ACC TTC ACT GGA ACT GGT GTT GTT CCC CCT GCA TCG CAA 359 
Asp Thr Thr Gly Thr Phe Thr Gly Thr Gly Val Val Pro Pro Ala Ser Gin 

80 85 90 

TAT GCT CAC TTT GCC AAC GAG GGT GCC AAT CTC TTC CGT ATT CCT TTC GCT 410 
Tyr Ala His Phe Ala Asn Glu Gly Ala Asn Leu Phe Arg lie Pro Phe Ala 
95 100 105 110 

TGG CAA TTG ATG ACT CCC ACT CTC GGT GGT AGC ATT AAC CAA ACC TTC TTC 461 
Trp Gin Leu Met Thr Pro Thr Leu Gly Gly Ser lie Asn Gin Thr Phe Phe 

115 120 125 

CAG TCC GAG TAC AAC CCA ACC GTC CAG GCT GCT CTG GCC ACC GGT GCT TAC 512 
Gin Ser Glu Tyr Asn Pro Thr Val Gin Ala Ala Leu Ala Thr Gly Ala Tyr 
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130 135 140 145 

GTC ATC 6TC GAC TTG CAC AAC TAT GCT CGA TGG AAC GGC CAG ATC ATT GGC 563 
Val He Val Asp Leu His Asn Tyr Ala Arg Trp Asn Gly Gin lie lie Gly 

150 155 160 

CAG GGT GGT CCG ACA AAC GCA CAA TTT GCC TCG ATC TGG ACT CAG CTC ACG 614 
Gin Gly Gly Pro Thr Asn Ala Gin Phe Ala Ser Me Trp Thr Gin Leu Thr 

165 170 175 

TCC TAC TAC GGC AAC AAC CCT AAA GTC ATC TTT GGC TTG ATG AAC GAG CCT 665 
Ser Tyr' Tyr Gly Asn Asn Pro Lys Val lie Phe Gly Leu Met Asn Glu Pro 
180 185 190 195 

CAT GAT CTC AAT TCA ATC CCC GAG TGG GCG GAC AGT CTC CAA TAC GTC GTC 716 
His Asp Leu Asn Ser lie Pro Glu Trp Ala Asp Ser Leu Gin Tyr Val Val 

200 205 210 

AAC GCC GTT CGT GCG GCC GGC TCG ACG AAC TAC CTC CTC TTA CCC GGT TCT 767 
Asn Ala Val Arg Ala Ala Gly Ser Thr Asn Tyr Leu Leu Leu Pro Gly Ser 
215 220 225 230 

TCC TGG GCT AGC GCA CAG GCA CTC CCG ACC GAG GCC GGA CCT TAC CTC CTC 818 
Ser Trp Ala Ser Ala Gin Ala Leu Pro Thr Glu Ala Gly Pro Tyr Leu Leu 

235 240 245 

CAA ATC ACT GAT CCT CTT GGC GGT ACC AAC AAA CTC ATT TTC GAT GTG CAC 869 
Gin lie Thr Asp Pro Leu Gly Gly Thr Asn Lys Leu Me Phe Asp Val His 

250 255 260 

AAG TAC CTC GAC AGC GAC AAC AGT GGC ACC CAC TCC AAC TGC GTT ACG AAC 920 
Lys Tyr Leu Asp Ser Asp Asn Ser Gly Thr His Ser Asn Cys Val Thr Asn 
265 270 275 280 

AAC ACT GGT GTC CTT CAG ACG CAC GTG ACC TGG CTC CAG CAG AAT GGC AAC 971 
Asn Thr Gly Val Leu Gin Thr His Val Thr Trp Leu Gin Gin Asn Gly Asn 

285 290 295 

CGT CAG GCG CTT CTG AGC GAG ACT GGT GGA GGT AGC TCT GAC AGC AGT TGC 1022 
Arg Gin Ala Leu Leu Ser Glu Thr Gly Gly Gly Ser Ser Asp Ser Ser Cys 
300 305 310 315 

GAG ACA TAT GTT GCC CAA GAG CTT GCA TTC GTT CAA GCC AAC AAG AAT AAC 1073 
Glu Thr Tyr Val Ala Gin Glu Leu Ala Phe Val Gin Ala Asn Lys Asn Asn 

320 325 330 

ATT GCC GGC TTT GCC ATC TGG GCC GCA GGT GCA TTC GAC ACG ACA TAC GTC 1124 
Me Ala Gly Phe Ala lie Trp Ala Ala Gly Ala Phe Asp Thr Thr Tyr Val 

335 340 345 

CTT AGC GTC ACC CCG AAC GCG GAC GGC TCG GAT CAG CCC CTC TGG TCT ATT 1175 
Leu Ser Val Thr Pro Asn Ala Asp Gly Ser Asp Gin Pro Leu Trp Ser Me 
350 355 360 365 

GCC GTC AAA CCT TAT CTG CCT TAAGAAACCT AATCTATTCA AAATCACCAA 1226 
Ala Val Lys Pro Tyr Leu Pro 
370 

TTGAAGTGAA TAAGAGCCAA TGGTTTCTTC TATATATATT TAGTTGTTTA 1276 
TCATGGAAAA AAATCCCGTT GTCGTTTGTG TATTAGATAT TTTGGAGAAA 1326 
AGTCAATTTA TTTGGTTTCT TGCTTCTCCA AAAAAAAAAA AAAAAA 1372 
[003 1] ta»*^^5KS«ttt4. *fc. S£&?-iB9U$ 
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[0032] *mm<o±it¥toMVi&ifn$hm±m*-£ 

[11] (a) l*. #x> Ky;u*t- tfite^MK 

&fcRlSttmm&&&? J ADNACDEcoRI ;H*b«J 
(07^0— Xy;umm*l|]CD^mT'fe^o (b) 



[12] (a) 12. C. P;U^i/-fSft±«lC#**L4 
*x> K{f;i/*^— -tf* >/^I(0SDS-PAG EO) 

(b) i*. *x>k«ou**— eatsT-KM!* 

(tai;»H«t*a>3Ba**-ea>*§IJi3la) sds~pa 

ge0M> (c) i* % *i>K^*t- 




(51) Int. CI. 7 F 1 f-w-K (##) 

(C 1 2 N 9/42 
C 1 2 R 1 :69) 

(72)f£BJ% ;feffl F*— A(#%) 4B024 AA03 AA05 BA12 CA04 DA11 

*585BxlSijmX3S^J»Br589S*4 ft» EA04 GA11 HA01 

iI5:an a n^DX^^-tz >^ — 4B050 CC03 DD03 LL05 
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